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PREFACE 


This report provides an introduction to the theory of linear sys- 
tems defined over rings. The theory is presented with a minimum of 
mathematical details, so that anyone having some knowledge of linear 
system theory should find the work easy to read. The presentation 
centers on four classes of systems that can be treated as linear systems 
over a ring. These are (i) discrete- time systems over a ring of scalars 
such as the integers; (ii) continuous-time systems containing time delays; 
(iii) large-scale discrete-time systems; (iv) time-varying discrete- 
time systems . 

The material given here is an expanded version of a series of lec- 
tures given at NASA Ames Research Center, Moffett Field, California 
during the week of November 7, 1977. I am very grateful to Professor 
R. E. Kalman and Brian Doolin for the opportunity of giving these lectures. 
My gratitude also extends to Eduardo Sontag, Yves Rouchaleau, and Bostwick 
Wyman for countless hours of discussions on linear systems over rings. 

Part of the research work presented here was supported by the U. S. Army 
Research Office, Research Triangle Park, N. C., under Grant DAAG29-77-G- 


0085. 



1. INTRODUCTION 


1.1 Linear systems over fields. 

Since the late 1950's, a great deal of effort has been devoted 
to the study of linear finite-dimensional continuous-time and discrete-time 
systems specified by state equations defined over a field K. More precisely, 
given a triple (F,G,H) of n*n, n*m, p*n matrices over a field K, a in- 
input p-output n-dimensional time-invariant discrete- time system is 
defined by the state equations 


x(t + 1) = Fx(t) + Gu(t) 


y (t) = Hx (t) 


( 1 . 1 ) 


where teZ = set of integers. In the continuous-time case, with K = R = 
field of real numbers, a time-invariant system is given by the state 
equations 


dx (t) 
dt 


= Fx(t) + Gu (t) 


y(t) = Hx (t) 


( 1 . 2 ) 


where teR. In both (1.1) and (1.2), the state x(t), input u(t) , and 
output y(t) are column vectors over the field K. 
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Initial efforts in studying linear systems defined over a field 


K usually assumed that K was a particular infinite field, such as the 
field R of real numbers. Infinite fields are fields that contain a 
subset which can be put into a one-to-one correspondence with the set of 
integers. In addition to the field R, the field of rational numbers 
and the field of complex numbers are examples of infinite fields. 

There are several textbooks (e.g. , [1,2,3]) on the theory of time- 
invariant and time-varying linear systems defined over the field R. 

In a sequence of publications beginning in 1965, R. E. Kalman 
[4,5,6] developed an algebraic theory for discrete- time systems of the 
form (1.1) defined over an arbitrary (finite or infinite) field K. 

Hence, in addition to being applicable to systems over the real or 
complex numbers, Kalman's theory can be applied to systems over finite 
fields, which includes linear sequential circuits [7], 

1.2 Discrete- time systems over a ring of scalars. 

After the completion of his work on discrete-time systems 
over arbitrary fields, Kalman initiated the study of discrete-time sys- 
tems over rings . The notion of a ring is a generalization of the notion 
of a field: A ring is a set with two operations, called addition and multi- 

plication; however, unlike a field, a ring can contain nonzero elements 
that do not have a multiplicative inverse (see [8, Chapter II]). 

An example of a ring is the set of integers Z with the usual 
addition and multiplication operations. Since neZ has a multiplicative 
inverse belonging to Z if and only if n = 1 or -1, Z is not a field. 

Another example of a ring is the set K[z] of polynomials in a symbol z 
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with coefficients in a field K, with the usual operations of polynomial 
addition and multiplication. The only elements in K[z] that are invertible 
are the nonzero polynomials of degree zero. 

The first detailed results dealing with discrete- time systems 
over arbitrary commutative rings were derived in Rouchaleau's Ph.D. 
thesis [9] in 1972 under the supervision of R. E. Kalman. This work was 
concerned with the class of linear time- invariant discrete-time systems 
over a commutative ring A given by the state equations 


x(t + 1) = Fx (t) + Gu (t) 


(1-3) 


y (t) = Hx(t) 


where teZ, F, G, H are n*n, n*m, pxn matrices over the ring A, and x(t), 

u(t), y(t) are column vectors over A. 

An interesting example of a system over a ring is a system 

with the ring of scalars equal to Z . By definition, such a system 

accepts vector sequences over Z, processes these sequences using integer 

operations , and then outputs vector sequences over Z . These systems are 

of interest from a computational standpoint, since integer operations can 

be implemented "exactly" on a digital computer (assuming that magnitudes are 
12 

less than 10 for 12-digit precision) . Further, systems over Z appear 
in various applications, such as coding theory [10] and scheduling or 
sequencing problems [11] . 
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1.3 Continuous- time systems over rings of operators. 


In 1973, Kamen [12] showed that a large class of linear infinite- 
dimensional continuous-time systems can be represented by first-order 
vector differential equations defined over a ring of operators (due to 
publication delays, [12] did not appear in print until 1975). The ring 
approach developed in [12] and [13,14] applies to time-invariant and time- 
varying continuous-time systems containing pure and distributed time 
delays. Independently of this work, Williams and Zakian [15,16] constructed 
the same type of operator setting for a class of time- invariant continuous- 
time systems with pure time delays. 

In this section we shall define the operator framework for the 
class of systems with commensurate delays given by the dynamical equations 

= f F . x (t - ia) + £ G . u (t - ia) 

dt v „ i . i 

i=0 i=0 

(i.4: 

s 

y (t) = £ H . x (t - ia) 

i=0 1 

where a is a fixed positive number, the F., G., H. are nxn, nxm, pxn 

iii 

matrices over the field of real numbers R, and x(t), u(t), y(t) are column 
vectors over R. 

The n-vector x(t) in (1.4) is usually referred to as the state at 
time t; however, as a result of the delay terms, the actual state at time 
t is the function segment x(x), t - qa < T < t. To solve (1.4) for t > 0, 

we need to know the actual state at time t = 0, which consists of the func- 
tion segment x(x), -qa < x < 0. 
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In the mathematics literature, equations of the form (1.4) , or 
generalizations of (1.4), are usually treated as ordinary differential 
equations in a Banach or Hilbert space consisting of function segments 
(see [17,18,19]). In contrast, we shall view (1.4) as a vector differen- 
tial equation with coefficients belonging to a ring of delay operators. 

The constructions are as follows. 

Let V denote the linear space consisting of all R-valued functions 

defined on R with support bounded on the left (i.e., for each veV, there 

is a t eR such that v(t) = 0 for all t<t ). Let d:V-*V denote the a-second 
v v 

delay operator on V defined by (dv) (t) = v(t-a), veV. We can extend 

d to column vectors v = (v, ,. . . ,v ) ' over V by defining 

1 n 

(dv) (t) = (v (t-a) ,v (t-a))' 

l n 

where the prime denotes the transpose operation. 

Let R[d] denote the set of all finite sums X a.d 1 where a.eR and 

i 1 1 

(d 1 v) (t) = v(t-ia) , veV. With the usual operations of polynomial addition 
and multiplication, R[d] is a ring of delay operators. 

Now viewing the state trajectory x and the input function u as 
column vectors over V (or some subspace of V) , we can write (1.4) in the form 


dx(t) 

dt 


(F(d)x)(t) + (G(d)u) (t) 


y (t) = (H(d)x) (t) 


(1.5) 
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where F(d), G(d), H(d) are nxn, n^m, pxn matrices over the operator ring 
R[d] given by 


q r 

F(d) = l F.d\ G (d) = l G.d 1 , 
i=0 1 i=0 1 


H(d) = l H.d 1 


i=0 


Thus the class of time-delay systems given by (1.4) can be studied 
in terms of the vector differential equation (1.5) defined over the ring 
R[d] . We shall refer to the system (1.5) as a linear time-invariant 
continuous-time system over the ring of operators R[d]. 

EXAMPLE 1.1 Consider the time-delay system given by the dynamical 
equations 


dx (t) 

— = -x (t-a) + x (t) - x (t-2a) + u(t) 

at 1 2 2 

dx (t) 

— — = x (t) + x (t-a) - x (t) + u (t-a) 

at ii 2 

y (t) = x^ (t-a) - x^ (t-2a) + x 2 (t) 

This system of equations can be written in the form (1.5) with 



O 





-d 1-d 


1 


F (d) = 


G (d) = 


H (d) 


1+d -1 


d 




By exploiting the structure of the operator-ring representation 
(1.5), we can develop an algebraic theory for the class of time-delay 
systems given by (1.4). In particular, we can obtain constructive results 
given in terms of operations on matrices and vectors defined over the ring 
R[d] , or ring extensions of R[d] . Examples will be given in the next 
three chapters of these notes. 

As noted above, systems with noncommensurate delays and distributed 
delays can be studied in terms of differential equations over rings of 
operators. Also, as shown in [14] , initial data for equations of the form 
(1.5), or generalizations of (1.5), can be incorporated into the operator 
framework. 

Note that in the representation (1.5), the elements of the ring 
R[d] act on functions; whereas in (1.3) , the elements of the ring A 
act on values of functions. This is the reason for referring to (1.5) 
as a system over a ring of operators and (1.3) as a system over a ring 
of scalars. We shall show that both types of systems can be studied 
using the same techniques. 

1.4 Discrete-time systems over a ring of operators. 

In this section we shall show that there are discrete- time systems 
that can be treated as systems over a ring of operators. Consider the 
class of discrete- time systems given by the following dynamical equations 

x (t + 1) = f F.x(t-i) + £ G.u(t-i) 
i=0 1 i=0 1 

s 

y (t) = l H.x(t-i) 
i=0 1 

where teZ, the F.,G., H. are n*n, n*m, p*n matrices over a field K, and 
xix 

x(t), u(t), y(t) are column vectors over K. 
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Note the similarity between (1.4) and (1.6). In fact, if we 
proceed as we did above for continuous- time systems with time delays, 
it is clear that we can write (1.6) as a vector difference equation 

over the operator rina K(o) consistina of all nolvnomials in the ricrh+ - - 
shift operator a :x (t) ->x (t-1) with coefficients belonging to the field 
K. More precisely, we have that 


x(t + l) = (F( CT )x)(t) + (G(cr)uHt) 


y (t) = (H(a)x) (t) 


(1.7) 


where F(a), G(a), H(a) are matrices over K[a] given by 

f(ct) = Yf.o 1 , G(a) = Jg.o 1 , h ( a) = jH.a 1 

.1 .1 .i 

xii 

We shall refer to the system (1.7) as a linear time- invariant discrete- 
time system over the ring of operators K[a] . 

Via the representation (1.7) , we can study large (high-dimensional) 
discrete- time systems in terms of some subset of the set of all possible 
state variables. This is made possible by lumping together components 
with memory in the coefficient matrices F(a), G(a), H(a). In other 
words, the representation (1.7) can be viewed as an aggregated model for 
large discrete- time systems. 
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The representation (1.7) is a natural model for systems consisting 
of an interconnection of subsystems separated by time lags with the 
components of x(t) equal to the states of the subsystems. This is 
illustrated by the following example. 

EXAMPLE 1.2 Suppose that K = R. Consider the discrete- time 
system given by the following block diagram. 



Here x^ft) (resp. x (t)) is the state of the subsystem having transfer 
function l/(z+l) (resp. 1/z-l) ) . We have that 


x^tt+l) = -x^t) - x ^ ( t— 1 ) + u(t) 


x 2 (t+l) = * 2 (t) + x 1 (t-l) 


y(t) = x (t) 


These equations can be written in the form (1.7) with 


F(o) 




G(c) 



g 
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Thus we have a representation of order two over R[cr]. Defining x^ (t) = 

x (t-1) and x.(t) = x„ (t-1) , we also have a four-dimensional representation 
1 4 2 

over R given by 


x^ (t+1) 


~1 

0 

0 

-1~ 


x 1 (t) 


1 

x 2 (t+1) 


0 

1 

1 

0 


x 2 (t) 

+ 

0 

x 3 (t+1) 


1 

0 

0 

0 


x 3 ( t) 


0 

x 4 (t+l) 


0 

1 

0 

0 


x 4 (t) 


0 


y(t) = x 2 (t) 


The representation over R[a] can be very useful, because in some problems 
it is more efficient from a computational standpoint to work with the 
two-dimensional representation over R[cr] , rather than the four-dimensional 
representation over R. An example will be given in Chapter 4. 

1.5 Additional examples. 

In addition to the examples given above, there are many other 
examples of systems that can be treated as linear systems over rings. 

These include systems given by discretized partial differential equations 

[20,21] and linear two-dimensional digital filters viewed as linear 

systems defined over a ring of proper rational functions in one variable [22] . 

Linear time-varying systems can also be viewed as systems over rings, 
in this case a ring of time functions. However, the algebraic theory 
of time- varying systems differs from the theory of time-invariant systems. 

The primary reason for the difference stems from the fact that time-varying 
systems, viewed as systems over a ring of time functions, must be studied 
in terms of a special type of nonlinear transformation, called a pseudolinear 
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transformation 123]. In the last chapter of these notes, we develop this 


approach for the class of linear time-varying discrete-time systems. The 
theory is based on the concept of a semi linear transformation, which is 
an example of a pseudolinear transformation. 
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2. REPRESENTATION AND REALIZATION THEORY 


2.1 Abstract systems over rings. 

For the three classes of systems defined in Sections 1.2 - 1.4, 
it turns out that many structural and dynamical properties depend only on 
the triple (F ,G,H) of matrices appearing in the state equations. In 
other words, the particular form of the dynamical equations (e.g., 
discrete time or continuous time) is not always a primary consideration. 
Thus it is possible to develop a general theory of systems over rings 
specified in terms of a triple of matrices over a ring. This was the 
approach taken by Sontag in his survey paper [24] on systems over rings. 
In this section, we define the major components of the theory, beginning 
with the notion of an abstract system over a ring. 

DEFINITION 2.1 Let A be a commutative ring and let n, m, p be 
fixed positive integers. An abstract (free) linear time-invariant system 
over A is a triple (F,G,H) of nxn, nxm, pxn matrices over A. The integer 
n is called the dimension of the system (F,G,H). 

Although the concept of an abstract system does not include any 
explicit reference to dynamical behavior, an abstract system (F,G,H) 
can be interpreted as a dynamical system by associating a set of state 
equations specified in terms of F,G,H. For instance, (F,G,H) can be 
interpreted as a discrete- time system over the ring of scalars A, given 
by the state equations (1.3). 

Let (F,G,H) be an abstract system over the ring A. Suppose that 
C is a ring extension of A, i.e., A is a subset of C with the property 
that addition and multiplication in C, when restricted to elements in A, 
are identical to addition and multiplication in A. In other words, the 
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ring A is a subring of C. Then since the elements comprising F,G,H 
can be viewed as elements of C, (F,G,H) can be viewed as an abstract sys- 
tem over C. 

It is of particular interest to note that when A is an integral 
domain, there is a ring extension of A which is a field. Integral 
domains are commutative rings that do not contain divisors of zero. 

That is, if ab = 0 for some a,beA, then either a or b must be zero. 
Examples of integral domains are the ring of integers Z and the operator 
rings R[d] and K[cr] constructed in the preceding chapter. 

An integral domain A can be viewed as a subring of a field Q(A), 
called the quotient field of A. The field Q (A) is equal to the set of all 
formal ratios a/b where a,beA, b ^ 0, with addition and multiplication 
defined by 


V b l + a 2 /b 2 = (a i b 2 + a 2 b l )/(b l b 2 ) 

(a i /b l ) ( V b 2 ) = (a i a 2 )/( V b 2 ) 


For example, the quotient field of Z is the field of rational numbers 
and the quotient field of R[d] is the field of rational functions in d. 

Now given an abstract system (F,G,H) over an integral domain A, 
we can view (F,G,H) as a system over the quotient field Q(A). Thus 
there exists the possibility of utilizing results from the theory of 
systems over fields in the study of systems over rings. Although 
this approach is useful (e.g., in the realization problem), the 


- 13 - 



quotient- fie Id framework seldom gives a complete solution to a given 
problem, since in general the results are specified in terms of elements 
of Q(A), rather than elements of A. 

Next, we have the concept of an input/output sequence. 

DEFINITION 2.2 Let m,p be fixed positive integers. An input/output 
(i/o) sequence f over a commutative ring A is a sequence f = (J , J 2 ' •••) 
consisting of pxm matrices over A. The i/o sequence of an abstract system 
(F,G,H) or a dynamical system specified in terms of (F,G,H) is the sequence 
f = (J^, J 2 , ...) where = H (F 1 ^)G for i = 1,2,... 

In the remainder of this section, we shall show that the i/o sequence 
associated with the systems defined in Sections 1.2 - 1.4 completely charac- 
terizes the input/output behavior of these systems. 

(a) Consider the discrete-time system over the ring of scalars A 
with the dynamical equations (1.3). Solving (1.3) by iteration, we have 

that the state x(t) at time t resulting from initial state x(t ) at time 

o 

t < t and input u(t) , t s t is given by 
o o 

t-t t-1 . 

x ( t) = F °x ( t ) + l F X Gu(i), t > t (2.1) 

o . o 

l=t 

o 

If x(t^) = Of the output response is 

v _1 t-i-1 

y (t) — HF Gu (i) , t > t (2.2) 

i= t 

o 
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From (2.2) it is seen that the input/output behavior of the system is com- 


pletely determined by the sequence (J^,^,...) where J = HF 1 for i = 1,2, 
(b) Consider a continuous-time system over the operator ring R[d] 
with the dynamical equations (1.5). Assuming that x(t) = 0 for 
t < 0 and taking the (one-sided) Laplace transform of (1.5), we get 


Y (s) = H(e~ as ) (si - F (e as )) 1 G(e _as )U(s) 


where F(e aS ) , G(e aS ) , H(e aS ) are computed from F (d) , G(d) , H(d) by 
setting d = e aS , and where U(s) (resp. Y(s)) is the Laplace transform 
of u(t) (y(t)). Thus the transfer function matrix T(s,e aS ) is given by 


-as, 
T(s,e ) 


H (e aS ) (si - F (e _aS ) ) 1 G(e aS ) 


— 1 — 1 

Expanding (si - F (e )) into a power series in s , we have that 

OO 

_ , -as. v . -as, i-1, -as,„, -as, -i 
T (s , e ) = l H (e )F (e )G(e )s 

i=l 

Hence T(s,e aS ) can be determined from the sequence (J^(d) ,J 2 (d), ), 

where J.(d) = H(d)F 1 ^(d)G(d). 
x 
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(c) Consider a discrete- time system over the operator ring 
K[ct] with the dynamical equations (1.7). We shall specify the input/ 
output behavior using the formal z-transform defined as follows. 

Let z be a symbol. Given a function f:Z-»-K with f(t) equal to zero 

for t<0, we define the (formal) z-transform of f to be the formal power 

-l °° _ • 

series F(z ) = £ f(i)z 1 . 

i=0 

Now if we assume that x(t) =0 for t^O and take the z-transform 
of (1.7), we get 


Y (z _1 ) = T(z _1 )U(z L ) 

where T(z "S = H(z 1 ) (zl - F(z "S ) 1 G(z ^) is the transfer function matrix. 
Expanding (zl - F(z S) ^ into a formal power series in z \ we have that 

00 

T(z _1 ) = l H(z" 1 )F 1 ~ 1 (z _1 )G(z" 1 )z _;L (2.3) 

i=l 

Thus the sequence (J^cr), j^(o), ...), where J^(o) = HfcOF 1 ^(ajGfcf), 
determines T(z "S . However, it should be noted that the expansion (2.3) 
is not unique. Therefore, the input/output behavior of a discrete-time 
system over K[ct] cannot be characterized uniquely by a sequence (J^(o), J ^(o) r . 
over K[a] . 

2.2. Problem of realization 

In this section we present a general approach to realizability 
based on the concept of abstract systems. The results given here can 
be applied directly to discrete-time systems over a ring of scalars 
and continuous- time systems over a ring of delay operators. 
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DEFINITION 2.3 An abstract system (F,G,H) over A is a realization 
of an i/o sequence (3 ,3 over A if and only if J i = HCF 1 1 )G 
for i = 1,2,... A realization (F,G,H) is minimal if its dimension is 
minimal among all possible realizations. 

Let us first consider the realization of a scalar sequence f = 
( a i' a 2'..')' a i e A * To the se< 3 uence we associate the formal power 

series w(z 1 ) = £ a.z 1 in the symbol z with coefficients in the ring 

1=1 1 -1 . . 

A. The power series w(z ) is said to be rational if it can be written 

as a ratio of the form 


n-1 n-2 

-1 c r,_i z + C r,-9 Z +... + c n z + c n 

z J. x n ± n Z 1.0 

W(z ) = — —Z 

n n— 1 

z + e .z + ... + e.z + e. 


'n-1 


0 


(2.4) 


where the c . , e . e A . 

l l 

We then have the following results on realization. The proofs of 
these results are omitted, as they are easy generalizations of the field 
case. 

PROPOSITION 2.4 A scalar sequence is realizable if and only if 
its associated power series w(z "S is rational. 

PROPOSITION 2.5 Let f be a scalar sequence with associated power 
series given by (2.4). If the polynomials comprising w(z '*’) contain no 
common factors of degree > 1, then (F,g,h) is a minimal realization of f 


' h " f o c i •" c n-3 


where 


F = 








— — 

0 

1 

0 

0 

... o 


0 

0 

0 

1 

0 

... o 


0 

. 

• 


• 

• 


. 

• 

• 

• 


» 

• 

f g = 

• 

0 

0 

0 

* f 

’ * ! 


0 

-e 

_ 0 

-e 

1 

_e 2 

» • 

. . -e 

n-1 


1 
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The result in Proposition 2.4 can be extended to matrix sequences 
f = (j ,j ,...) as follows. Let W(z "S denote the formal matrix power 

1 2 oo 

— 1 ”1 P *■ 1 

series in z associated with f;i.e.,W(z )=) J.z . . By definition, 

i=l 1 

W(z ■*■) can be written as a matrix whose elements are scalar power series 
in z 1 . The matrix W(z is rational if each element of W(z 1 ) is 
rational. 

We then have the following result [24] . 

PROPOSITION 2.6 The matrix sequence f is realizable if and only if 
its associated power series is rational. 

The realization of matrix sequences can be approached by first reali- 
zing the elements of W(z "S , but the resulting realizations are seldom 
minimal. As will be seen, for a special class of rings, minimal realiza- 
tions of a matrix sequence f = (J can be constructed from the Hankel 

matrix B (f ) given by 


B (f ) 



It is well known [6] that a matrix sequence f over a field K is 
realizable if and only if the rank of B(f) is finite. The rank of B(f) 
is the smallest integer q such that all minors of B(f) of order greater 
than q are zero. If the rank of B(f) is finite, f has a minimal 
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realization of dimension equal to the rank of B(f). 

Realizability results for matrix sequences over fields can be applied 
to matrix sequences over an integral domain A: Given such an f, we can view 

f as a sequence over the quotient field Q(A) . Then since Q(A) is a field, 
f has a realization over Q( A) if and only if the Hankel matrix B(f) has 
finite rank as a matrix over Q (A) . But since we are seeking realizations 
over A, we would like to know when realizability over Q(A) implies realiz- 
ability over A. As proved in [9] , this is the case for the class of rings 
referred to as principal ideal domains (for more general results see 
[25,26,27] ) . 

A principal ideal domain (p.i.d.) is an integral domain A with the 
property that, for any subset S of A such that a + beS whenever a,beS 
and abeS whenever aeA, beS, there is an element aeS such that S = {ab:beA} 
(i.e., every ideal S of A is generated by a single element aeS ) . The 
ring of integers and the operator rings R[d] and K[o] are examples of 
p.i.d. ' s . 

Since realizability over Q(A) implies realizability over A when A 
is a p.i.d., we see that if a matrix sequence over Z has a realization 
over the field of rational numbers, there is a realization over Z; and if 
a matrix sequence over R[d] has a realization over the field of rational 
functions in d, there is a realization over R[d] . These are very interes- 
ting results. 

It is proved in [9] that a realizable matrix sequence f over a p.i.d. 
has a minimal realization of dimension equal to the rank of the Hankel 
matrix B(f) as a matrix over Q(A). In the p.i.d. case, minimal realizations 
can be computed from B(f) using Rouchaleau's algorithm [9]. The algorithm 
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yields a minimal realization over Q(A) using Silverman's formulas, from which 
a minimal realization over A is generated via a similarity transformation. 

The steps of a condensed version of this algorithm are given below. 

Let f = be a matrix sequence consisting of pxm matrices 

defined over a p.i.d. A. Suppose that B(f) has finite rank equal to n. 

Then a minimal realization of f over A can be constructed by carrying 
out the following steps. 

(1) Let C be a nxn submatrix of B(f) having rank n as a matrix over Q(A). 

(2) Let D be the nxnm submatrix of B(f) containing the same rows as C 
and the first n block columns of B(f). 

(3) From D construct a nxn matrix V as follows. Let b^ be the 

greatest common divisor of the elements in the first row of D. There is 

an A- linear combination v of the columns of D having b^ as first element; 

and for each column d. of D, there is a r.eA such that the first element 

i l 

of d. - r.v„ is zero. Define D, = [d, - r,v, d - r v, . . . d - r v_ ] . 
ill 1 111221 nmnml 

Apply the same procedure to working with the second row, which yields a 

column vector v_ and a matrix D_. Continue until [v. ,v~...v ] = V is constructed. 
2 2 1 ^ n 

(4) Let N be the nxm submatrix of the first block column of B(f) 
containing the same rows as C. 

(5) Let E be the pxn submatrix of the first block row of B(f) 
containing the same columns as C. 

(6) Let M be the nxn submatrix of B(f) sitting to the right of C. 

Then (F,G,H) is a minimal realization of f where 


F = (V 1 )M(C 1 )V, G = (V 1 )N, H =E (C 1 )V 


(2.5) 
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2.3 Example 


Consider the continuous-time system with time delays given by the 
transfer function matrix 


T(s,e S ) = 


2, , -s -s 

s +(e -l)s-e 




-s 

e s 


+ 1 


+ 


-2s 

e 


— s 

We want to compute a minimal realization of T(s,e ) given by a triple 
(F (d) , G (d) , H(d)) over the operator ring R[d] with 


T (s , e~ S ) = H (e S ) (si - F (e~ S ) ) ^(e" 3 ) 


— s 

Since the least common denominator (as a polynomial in s) of T(s,e ) 

— s 

has degree equal to two, it follows that T(s,e ) has a realization over R[d] 

of dimension 4. We shall compute a minimal realization using the above 

— s — 2. 

algorithm. First, expanding T(s,e ) into a power series in s , we get 


T(s,e S ) = 



-s 


-s 

„ -s 


-2s 


1 

e 

-1 

-e 

1+e 

-2 

G 

1 


0 

s + 

-s 


s + 

-2s 

-s 

-1 


__ e 

-1 


_-e 

e +1 J 


s ' 3 + 
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Then 


B(f) = 


1- 

-s 

1 -s 

-s 

» 

i 

1 

e 

1 ” e 

1+e 


-1 

0 

1 -s 

1 e 

-1 

i 

-s 

-e 

l+e~ S 

l -2s 
e 

1 

• 

i 

-s 


1 -2s 

-s . , 

i 

e 

-1 

* - e 

e +1 


• 

- 

l 

• 

i 

i 

• 

• 

- 

• 


. 


. 

. 



Since there is a realization of dimension 4, the rank of B(f) must be less 
than or equal to 4 , so it is not necessary to consider any more blocks 
in B ( f ) than those given above. Checking the minors of the 4 X 4 submatrix of 
B (f ) given above, we find that the rank of B(f) is 2. Thus there is a reali- 
zation of dimension 2. We then choose 


so that 
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Applying the procedure given in Step (3) to D, we get 


1 


V = 


-1 





We have that N = E = C and M = 


-s 

e 


-1 


— s 

Then from (2.5), we have the following minimal realization of T(s,e ) 



-d 

1+d+d 2 


1 

d 


1 

0 

F (d) = 

0 

1 

, G(d) = 

0 

1 

, H(d) = 

-1 

d 


In component form, the realization is given by 


dx x (t) 
dt 


-x '(t-1) + * 2 (t) + x 2 <t-l) + x 2 (t-2) 


+ u^t) + u 2 (t-l) 


dx 2 (t) 
dt 


= x 2 (t) + u 2 (t) 


Y l (t) = x l (t) 

y 2 (t) = -x 1 (t) + x 2 (t-l) 
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3. REACHABILITY, OBSERVABILITY, AND DUALITY 


We shall now study the concepts of reachability and observability 
for discrete-time systems over a ring of scalars and continuous-time 
systems over the operator ring R[d] . The results obtained here lead to 
a definition of reachability and observability for abstract systems over 
rings. In the last part of Section 3.3, we consider a concept of duality 
for abstract systems. 

3.1 Discrete- time systems over a ring of scalars. 

Consider the discrete-time system given by the state equations 


x(t + 1) = Fx (t) + Gu (t) 
y (t) = Hx (t) 


(3.1) 


where F,G,H are n*n, n»m, p*n matrices over the ring A. Let A n denote 
the set of all n-element column vectors over A. 

DEFINITION 3.1 The system (3.1) is reachable if for any xeA n , 
there is an integer N > 0 and inputs u(0) , u(l), ...,u(N-l) that drive 
the system from the zero state at time t = 0 to the state x at time t = N. 

The system is reachable in n steps if for any xeA n , the integer N can be taken 
to be n. 

In Definition 3.1, we have taken the initial time to be zero. This 

does not imply any loss of generality, since the system is time invariant. 

Let v ,v , . . . ,v be fixed elements belonging to A n . We say 
12 q 

that xeA n is an A-linear combination of v, ,v , . . . ,v if x can be written 

12 q 

in the form 


q 

x = y e.v, for some e.eA 
“ l l 

i=l 
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In terms of this concept, we have the following criterion for reachability. 

PROPOSITION 3.2 The following conditions are equivalent. 

(1) The system (3.1) is reachable. 

(2) The system (3.1) is reachable in n steps. 

(3) Every element of A n can be written as an A-linear combination 

n _ ^ 

of the columns of G,FG,...,F G. 

Proof. From (2.1), the state x(n) at time t = n starting from 
x(0) =0 is given by 

n-l 1 

x(n) = y F n 1 Gu(i) (3.2) 

i=0 

The equivalence of conditions (2) and (3) follows from (3.2). The 
equivalence of conditions (1) and (2) follows from the Cayley-Hamilton 
theorem [8, page 400] . 

COROLLARY 3.3 Let m = 1 (the single-input case). Then (3.1) is 
reachable if and only if the nxn matrix [G,FG,...,F n ^G] is invertible 
over A, which is the case if and only if the determinant of [G,FG,...,F n ^G] 
has an inverse in A. 

EXAMPLE 3.4 Suppose that (3.1) is a single-input system over the ring 
of integers Z. Since the only invertible elements of Z are +1 and -1, 
by Corollary 3.3 we have that the system is reachable if and only if the 
determinant of [G,FG,...,F n ^G] is equal to +1 or -1. Thus, given a system 
selected "at random", it is very unlikely that every element in Z n can be 
reached using integer-valued controls. 

Again consider the system (3.1) defined over the ring A. Let U de- 
note the n*mn matrix [G,FG,...,F n ^G] , and let ueA 1 ™ 1 denote the control 
vector u = [u(n-l) u(n-2) u(l) u(0)]'. It follows from (3.2) that u drives 
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the system to the state xeA n at time t = n if and only if u satisfies the 
equation x = Uu. If the system is reachable, in the single-input case there is 
a unique solution given by u = U ^x, where U is the inverse of U. In the 
multi- input case (m > 1) , the computation of a control u (if one exists) is in 
general a difficult problem. For a class of fields, we have the following 
result. 

v 2 

THEOREM 3.5 Let A = K, where K is a field with the property that ^ 

-1 for any a^K. Then the system (3.1) is reachable if and only if the nxn 
matrix U(U') is invertible over K, in which case a solution ueK 1011 of x = Uu is 
u = (U') [U(U’)]" 1 x. 

The above result is well known for the case in which K = R = field of 

real numbers. The usual proof for the K = R case extends to the class of 

v 2 

fields K with the property that ^ -1 for any a^eK. It is also well known 

that when K = R, the control u = (U ' ) [U (U ' ) ] -1 x minimizes |ju||, where || || 
denotes the Euclidean norm, and where u ranges over all solutions of x = Uu. 

In the ring case, invertibility of U(U') over A implies that the system 
is reachable, but the converse is not true: 

EXAMPLE 3.6 Consider the discrete-time system over Z given by x(t + 1) = 
x ( t) + u L (t) + u 2 (t), y (t) = Hx (t) . Here G = [1 1], and U = G. Since the 
columns of U generate Z , the system is reachable . But U (U ' ) = 2 , which does 
not have an inverse in Z. 

Now assume that A is an integral domain with quotient field Q(A) 
having the property that ¥ ~1 for any a^eQ(A). Suppose that the system 

(3.1), viewed as a system over Q (A) , is reachable. Then by Theorem 3.5, 
any x£A n can be reached by applying the control u = (U’ ) [U (U 1 ) ] -1 x, which 
in general is defined over Q(A). This is an acceptable solution if the 
control over Q(A) can be generated. 
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EXAMPLE 3.7 Again let (3.1) be a system over Z. Viewing (3.1) as 
a system over Q(Z), the field of rational numbers, we have that it is 
reachable if and only if the determinant of U(U') is nonzero. In this case, 
the control u = (U ' ) [U (U ' ) ] would be a vector of rational numbers in 
general . 

For systems over Z , a very interesting problem is the computation of 
integer- valued controls u that minimize ||u||, where u ranges over all integer- 
valued solutions of x =Uu. When the determinant of U(U') is equal to +1 or 
-1, an integer-valued solution minimizing ||u|| is u = (U')[U(U')] ^x. Unfor- 
tunately, invertibility of U(U') is too severe of a condition to be of 
much value . 

_ mn 

For systems over a p.i.d. (such as Z) , controls belonging to A can 

be computed by first putting U into diagonal form (the Smith form) using 
row and column operations (see [28, page 109]). The details of this proce- 
dure will not be considered here. 

We now consider the concept of observability. 

DEFINITION 3.8 The system (3.1) is observable if for any nonzero initial 
state x(0)eA n , there is an integer N > 0 such that the output response y(t) 
resulting from x(0) is nonzero for at least one value of t e{0,l, . . . ,N-1} . The 
system is observable in n steps if for any nonzero x(0)eA n , the integer N can 
be taken to be n. 

PROPOSITION 3.9 The following conditions are equivalent. 

(1) The system (3.1) is observable. 

(2) The system (3.1) is observable in n steps. 

(3) There is no nonzero x£A n such that 
x'[H , ,(F') (H'),...,(F') n_1 (h ' ) ] = 0. 
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Proof. From (2.1), the output response y(t) for 0 ^ t^n-1 resulting from 


initial state x(0)eA 


n 


can be expressed in the form 


y (0) 


H 

y (1) 

_ 

HF 

_y (n-l)_ 


hV" 1 ). 


(3.3) 


The equivalence of conditions (2) and (3) follows directly from (3.3). 

The equivalence of conditions (1) and (2) follows from the Cayley-Hamilton 
theorem . 

Let V denote the np x n matrix 


V = 


H 

HF 


H(F n_1 ) 


np 1 

and let yeA denote the response vector [y(0) y(l) ... y(n-l)] . By (3.3), 

the response vector y resulting from initial state xeA n at time t = 0 is 

given by y = Vx. We would like to be able to compute the initial state x from 

knowledge of y. In the case of systems over fields, this problem is dual to 

the problem of computing controls in the generation of states. In particular, 

we have the dual of Theorem 3.5. 

_ y 2 

THEOREM 3.10 Let A = K, where K is a field with ) .a. ^ -1 for any a.eK. 

^11 l 

Then the system (3.1) is observable if and only if the n x n matrix (V')V is 
invertible over K, in which case, given y = Vx for some xeK. n , x = (V'V) "*‘(V')y. 
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Proof. Recall that a system over a field is observable if and only if 
its dual, specified in terms of the matrices F' ,H’ ,G' , is reachable [5] . 

Then apply Theorem 3.5. 

Although the duality between reachability and observability does not 
extend to systems over rings (see Section 3.3), Theorem 3.10 can be extended 
to systems over an integral domain as follows. 

r 2 

COROLLARY 3.11 Suppose that A is an integral domain with 2 a - ^ "1 
for any a^eQCA). Then the system (3.1) over A is observable if and only if 
(V')V is invertible over Q(A), in which case, given y = Vx for some xeA n , 
x = (V 1 V) _1 (V ' ) y . 

Proof. It is easily verified that the system (3.1) is observable if and 
only if it is observable as a system over the field Q( A) . Then apply Theorem 3.10. 
3.2 Continuous-time systems over a ring of delay operators 

Consider the continuous -time system given by 


dx (t) 
dt 


(F(d)x)(t) + (G (d) u) (t) 


(3.4) 


y(t) = (H(d)x) (t) 


where F(d), G(d), H(d) are n*n, nxm, p*n matrices over the operator ring 
Rid] . 

2 n 

Given a fixed positive number t^, let L ([0,t^];R ) denote the Hilbert 
space of square- integrable functions on [0,t^J with values in R n . The norm 
||f|| of a function belonging to L 2 ( [0 , t^] ; R n ) is defined by 


l! f 


t l||f (t) || 2 dt 
J 0 
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where the norm in the integrand is the Euclidean norm. 

We can now define the notion of Euclidean reachability. 

DEFINITION 3.12 The system (3.4) is R n -reachable in time t^ > 0 
n 2 m 

if for any xeR , there is an input ueL ([0,t^];R ) that drives the system 
to the state x at time t = t^ with initial state function x(t) = 0 for t S 0. 

We shall give a necessary and sufficient condition for R n -reachability 
in terms of the following constructions. 

Let M(t) denote the inverse Laplace transform of the nxm matrix (si - 
„ , -as. . -1 , -as, 

F(e )) G(e ). it can be shown (see [14]) that the state x(t^) at time t 
resulting from the input ueL 2 ( [0,t^3 ;R m ) with x(t) = 0 for t S 0 is given by 


x(t 1 ) 



s) u (s) ds 


(3.5) 


Define the map X (t^) :L 2 ( [0,t^] ;R m )->- R n :u 




M(t, 


s)u(s)ds. 


We then have the following result which follows directly from (3.5). 

PROPOSITION 3.13 The system (3.4) is R n -reachable in time t if and 
only if the map X(t^) is onto. 

We can get an explicit condition for reachability in terms of the 
* 

adjoint X (t^)of X(t^) defined by 


* 

X 


(t 1 ) :R n ->-L 2 ( [0,t 1 J ;R m ) 


s)x, 0 < s ^ t^ 


Here we are using the result that the elements of M(t) are square- in tegr able 
on any finite interval 0 ^ t ^ t^. 
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I 


* n n 

The composition X (t ) X (t^) is a map from R into R given by 


XCt^X (t^ :x-*B (t^) x 


where 


B (t^) = J 1 M(t 1 - s) M' (t^ - s) ds 

The matrix B(t^) is the controllability gramian for time-delay systems. 

THEOREM 3.14 The system (3.4) is R n -reachable in time t if and only 

* -1 

if B(t^) is invertible, in which case the control u = X (t^)B (t^)x sets 

up the state xeR n at time t = t^. Further, this control minimizes |[u||, 
where u ranges over all solutions of x = X(t^)u. 

Proof. Follows from standard results in the theory of linear transfor- 
mations and their adjoints (refer to Luenberger's book [29]). 

Although B (t) is constructed from the matrices F(d) and G(d), we would 
like to have a criterion for R n -reachability specified directly in terms 

of F (d) , G(d). This can be accomplished by first noting that the system 

n * 

(3.4) is R -reachable in time t if and only if X (t^) is one-to-one, 

which is the case if and only if there is no nonzero xeR n such that x'M(t) = 0 

for 0 £ t ^ t . We then have the following result. 

1 

THEOREM 3.15 Given the system (3.4), suppose that F (d) = ^F . d* and 
r . i=Q L 

G(d) = T G.d 1 , where the F. and G. are matrices over R. Then the following 
i=0 1 

are equivalent. 
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(1) The system is R n -reachable in time t^ for some t > 0. 

(2) The system is R n -reachable in time t^ for any t > g(n-l)a + ra. 

(3) There is no nonzero xeR n such that 

x 1 [G(d) ,F(d)G(d) ,...,F(d) n_1 G(d)] = 0. 

Proof. Clearly, (2) 2^(1). We shall show that (1) implies (3): 

Suppose that there is a nonzero xeR n such that x' [G (d) ,F (d)G (d) , . . . ,F (d) n ^GCd)] 
Then by the Cayley-Hamilton theorem, x'F(d) 1 "''G(d) = 0 for all i > 1. Now by 
definition of M(t) , it can be expanded into the series 


M(t) = l (F (d) 1 ^(dm 1 X , t > 0 
i=l 


i-1 


Thus, x'M(t) = 0 for all t 5 0, so the system is not R -reachable. 

(3)^(2): Suppose that the system is not R n -reachable for t > q(n-l)a + ra. 

Then there exists a nonzero xeR n such that x'M(t) = 0 for 0 < t < q(n-l)a + ra. 

— 2 _ 

Now let A (t) denote the inverse Laplace transform of (si - F(e )) 

r 

Than M(t) =■ (AG(d)) (t) = £ A(t - ia)G.. Evaluating M(t) at t = ia and 

i=0 1 

noting that A(0) = I = nxn identity matrix, we have that x'G^ = 0 for i = 


0,1,2, ...,r. Hence x'G(d) = 0. Taking the first derivative of x'M(-y 

q r 

for t > 0, we have that x' (F(d)AG(d)) (t) = x' £ £ F . A (t - ia - ja)G. 

i=0 j^O 1 3 _ 

for 0 < t < q(n-l)a + ra. Evaluating this at t = ia + ja, we get 


= 0 


x'F^G. = 0 for i = 0,1,..., q and j = 0,.,...,r, which implies that x'F(d)G(d) = 0 
Continuing in this manner, we can show that x'F(d) 1 "'"GW) = 0 for i= 1,2,.. . ,n. 

n _ -J^ 

Now let U (d) denote the n><mn matrix [G (d) ,F (d) G (d) , . . . ,F (d) G(d)], 

and let R(d) denote the quotient field of R[d] . We then have the following 


sufficient condition for R -reachability. 
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COROLLARY 3.16 Suppose that the rank of U(d), viewed as a matrix 
over R(d) , is equal to n. Then the system (3.4) is R n -reachable. 

Proof. The rank of U(d) is equal to n if and only if there is no 
nonzero xeR(d) n such that x'U(d) = 0. Then apply Theorem 3.15. 

A generalized version of condition (3) in Theorem 3.15 was given by 
Sontag [24, page 28]. Results similar to those given in Theorem 3.15 
were derived by Williams and Zakian [16] . For a geometric version of these 
results, see Byrnes [30]. 

Unfortunately, the notions of R -reachability and R -controllability 
(driving x(t) to zero at some time t^ > 0) are of limited use in the 
design of control systems with time delays, since the value of x(t) at time t 
is not the complete state of the system. For example, the existence of 
an open- loop or closed- loop control which drives x(t) to zero at some time 
t 1 > 0 does not imply that x(t) will remain zero for all t > t . This of 
course is due to the storage of signals within the delay lines. 

In the literature on systems with time delays, conditions based on 
the concepts of exact or approximate function space reachability are usually 
considered [31] . These conditions are much stronger than R n -reachability 
since they deal with the problem of reaching (from zero) or controlling 
(to zero) R -valued function segments, rather than points in R n . 

It is interesting to note that the rank condition in Corollary 3.16 
is stronger than R 12 - reachability : 

EXAMPLE 3.17: Consider the system (3. 4) with 


1 0 


1 


1 1 


and G (d) = 


, so that U (d) = 


_0 1 


d_ 


d d 


- 33 - 



2 2 
There is no nonzero xeR such that x'U(d) =0, so the system is R -reachable. 

But the rank of U (d) is equal to one. Thus R n -reachability does not neces- 
sarily imply that the rank of U (d) is equal to n. 

A very interesting open question is whether or not the rank condition 
in Corollary 3.16 is equivalent to one of the various notions of function 
space reachability specified in terms of some topological structure. 

There is a much stronger condition than the rank condition given in 
Corollary 3.16. Namely, we can require that any xeR[d] n can be written 
as an R[d] -linear combination of the columns of U (d) . In the single-input 
case, we can do this if and only if the determinant of U (d) is equal to a 
nonzero element of R (which is seldom the case) . As will be seen in 
the next chapter, this condition is necessary and sufficient for "pole 
assignability" using state feedback of the form u = -B(d)x, where the feedback 
matrix B (d) is a lx n matrix over R[d] . 

The last topic of this section is R n -observability . 

DEFINITION 3.18 The system (3.4) is R n -observable in time t > 0 
if for any nonzero initial state x(0)eR n with x(t) =0 for all t < 0, 
the output response y(t) resulting from x(0) is nonzero for some range of 
values of t belonging to [0,t^]. 

The next result states that R n -observability is dual to R n - 
reachability . 

PROPOSITION 3.19 The system (3.4) is R n -observable in time t if 
and only if the dual system, specified by the matrices F(d)', H(d)', G(d)', 
is R n -reachable in time t . 

Proof. Follows from the theory of adjoints [29] . 
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COROLLARY 3.20 Given the system (3.4), suppose that F (d) = 
q s 

I F.d 1 and H(d) = £ H.d 1 . Then the following are equivalent. 
i=0 1 i=0 1 


(1) The system is R -observable in time t^ for some t^ > 0. 

(2) The system is R n -observable in time t^ for any t^ > q(n-l)a + sa. 

(3) There is no nonzero xeR n such that 


x’ [H(d) ' ,F(d) >H(d) (F(d) ') n_1 H(d) '] = 0 


Dualizing Corollary 3.16, we have that the system (3.4) is R n - 
observable if the rank of [H (d) ' ,F (d) 'H (d) 1 , . . . , (F (d) ' ) n H(d)'], 
viewed as a matrix over the quotient field R(d) , is equal to n. The relation- 
ship (if there is one) between this rank condition and the various notions 
of function space observability is not known at present. 

Suppose that (3.4) is R n -observable in time t^ . Given the output 
response y(t) on [0,t^] resulting from initial state x(0)eR n , by dualizing 
Theorem 3.14 we can derive an expression for x(0) in terms of y(t) on [0,t ]. 

The straight-forward details are omitted. 

3.3 Abstract systems. 

The results given in Sections 3.1 and 3.2 suggest the following notions 
of reachability and observability for abstract systems. Here we also define 
canonical systems and minimal systems. 

DEFINITION 3.21 Let ( F,G,H) be an n-dimensional abstract system 
over the commutative ring A. Then (F,G,H) is reachable if any xeA* 1 can 
be written as an A-linear combination of the columns of G,FG,...,F G. 

The system is observable if there is no nonzero x£A n such that x'fH'jF'H', 

. . . (F' ) n "*"H' ] =0. The system is canonical if it is reachable and observable. 
The system is minimal if its input/output sequence f = (HG,HFG, ... ,H(F 1 1 )G,...) 
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cannot be realized by a system over A with dimension strictly less than n. 

When A is a field, it is known [6] that a system is canonical if and 
only if it is minimal. When A is not a field, it is still true that a 
canonical system is minimal, but the converse is not necessarily true: 

EXAMPLE 3.22 Consider the one-dimensional system (l,d,l) over R[d] . 

The system is obviously minimal. But since R[d] ^ {d7T (d) : tt (d) eR [d] } , it is not 
reachable, so it is not canonical. 

We define the dual of (F ,G,H) to be the system(F' ,H' ,G' ) . If (F',H',G') 
is reachable, it can be shown that (F,G,H) is observable. But it is not 
necessarily true that observability of (F,G,H) implies reachability of 
(F' ,H' ,G' ) : 

EXAMPLE 3.23 Consider the system (F,G,H) over Z with F = 1 and H = 2. 

There is no nonzero xeZ such that Hx = 0, so the system is observable. 

But not every xeZ can be written in the form x = (H')a = 2a for some aeZ, so 
the dual is not reachable. 

Thus reachability of the dual is a stronger condition than observability 
of the given system. In the next chapter, we shall see that the stronger 
condition is useful in the construction of state observers. 

A reachable system whose dual is also reachable is said to be a split 
system [24]. For split systems, it is possible to approach the design of 
regulators by feeding back an estimate of the state obtained from a state 
observer. An interesting question is whether or not a given input/output 
sequence can be realized by a split system. For results on this, see Sontag [24] 
and Byrnes [30] . 
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4. CONTROLLERS AND OBSERVERS 


4 . 1 Introduction . 

In the first part of this chapter, we consider the construction of 
feedback controllers for abstract systems over rings. The general theory 
is illustrated by examples involving the three classes of systems defined 
in Sections 1.2 - 1.4. In the last section of the chapter, we consider the 
construction of observers by dualizing the results on feedback controllers. 

4.2 Assignability. 

Let i! be a ring extension of the ring A. Given a fixed element 

0 belonging to £2 , let A[0] denote the subring of £2 consisting of all finite 
q 

v i 0 

sums of the form 2, a. 9 , where a.eA and 0 =1. Given a symbol z, let A[z] 

i=0 1 1 

denote the ring of polynomials in z with coefficients in A. Finally, let p 
denote the map 

p :A[z]-*A[0] : ^ a.z'''-)- £ a.0^ 
i=0 1 i=0 1 

The element 0 is said to be transcendental over A if the map p is 
one-to-one. That is, there does not exist a nonzero polynomial Tr(z)eA[z] 
such that p(ir(z)) = it( 0) =0. If 9 is transcendental over A, the rings A[z] 
and A [ 0] are isomorphic (i.e., p is onto and one-to-one, and P + ^3^ = 

p (ir^) p (tt 2 ) + p ( tt 3 ) for any 7T 3 eA [ z ^- 

If 0 is not transcendental over A, it is said to be algebraic over A. 

If 0 is algebraic over A, an element of A[0] does not have a unique expression 
as a polynomial in 0 with coefficients in A. Nevertheless, we can still 
say that beA is a zero of ir(0)eA[0] if there exists a 3(0)eA[0] such that ir(0) 
= (0 - b) 3(6) • 

Let (F , G, H) be an abstract system over A. As before, we assume that 
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F is nxn, G is nxm, and H is pxn. Given a mxn matrix B over A, the system 

(F-GB,G,H) will be referred to as a closed-loop system formed from (F,G,H). 

The matrix B is called the feedback matrix (or controller) . Now given 

0efi=>A, let det(0I - F + GB) denote the determinant of 01 - F + GB, where I 

is the n x n identity matrix. It can be shown that det (01 - F + GB) is an 

n-1 

element of A[0] of the form 0 + J a.0 , a.gA. In terms of these construc- 
ts 1 1 

tions , we have the following concepts . 

DEFINITION 4.1 The triple (F,G,0) is coefficient assignable if 

for any b^/b^, . . . ,b n _^ belonging to A, there is a B over A such that 

n-1 

det(0I - F + GB) = 0 n + £ b.0 . The triple (F,G,0) is zero assignable 

i=0 1 

if for any e ,c 2f ..., c belonging to A, there is a B over A such that det(0I - F + GB) 

= (0 - c. ) (0 - c ) ... (0 - c ) . 

12 n 

As indicated below, for the three classes of systems defined in 
Sections 1.2 - 1.4, zero (or coefficient) assignability implies that by 
employing state feedback, we can specify the "asymptotic" behavior of the 
state response resulting from arbitrary initial states with zero input. 

(a) Consider a discrete-time system over a ring of scalars A given 
by the dynamical equations 


x(t + 1) = Fx (t) + Gu (t) 
y (t) = Hx (t) 

We can feed back the state x(t) by setting 

u(t) = -Bx (t) + r (t) 


(4.1) 


(4.2) 


where B is the feedback matrix defined over A and r (t) is an external input 
or disturbance. Combining (4.1) and (4.2), we have that the closed-loop 
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system is given by 


x(t + 1) = (F - GB)x(t) + Gr (t) (4.3a) 

y(t) = Hx (t) (4.3b) 

Let a 1 denote the left-shift operator defined by (a ^f) (t) = f(t + 1), 
where f is a function on Z with values in A. Then viewing A as a subring 
of A[cr ^] , we can write (4.3a) in the form 

(a _1 I - F + GB) x = Gr (4.4) 


From a well-known result in matrix theory [8, page 334] , we have that 

[det(a _1 I - F + GB)][0 _1 I - F + GB] _1 = Adj (0 _1 I - F + GB) (4.5) 


where Adj (a I - F + GB) is the transpose of the matrix of cofactors of 
a I - F + GB. It follows from (4.4) and (4.5) that if the triple (F,G,o 1 ) 
is zero or coefficient assignable, by selecting the feedback matrix B, 
we can specify the behavior of the free response x(t) (i.e., r(t) = 0 for 
t > 0) as t-*°°. In particular, by choosing B so that det(c ^1 - F + GB) = 
a n , we have that the free response x(t) is zero for t > n, starting from 
any initial state x(0)eA n . This is often referred to as "dead-beat" control. 

(b) Consider the discrete-time system over the operator ring K[a] 
given by 


x(t + 1) = (F (a) x) (t) + (G (aj u) (t) 
y (t) = (H(a)x) (t) 


(4.6) 
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Setting u(t) = -(B(cr)x) (t) + r(t), we have that the closed-loop system is 
given by 


x(t + 1) = (F (a) - G(a)B(a)x) (t) + (G(a)r)(t) 


(4.7a) 


y (t) = (H(a)x) (t) 


(4.7b) 


where B(a) is a matrix over K[a] . In this case, the feedback signal 

-(B(o)x) (t) consists of delayed versions of x(t). 

Let a 1 denote the inverse of a, so that a ^ is the left-shift 

operator. Now we can view K[a] as a subring of the ring K[a, a 1 ] consisting 

n l n , 

) a. .a a with the a. . eK. It is important to note 
^ i=0 j=0 13 13 

that a is not transcendental over K[cr] . To see this, consider the 

polynomial ir(z) = az - le(K[cr]) [z] . We have that Tr(a ) = cr(a ) -1 = 0, 

so the map p defined above is not one-to-one. It follows that the ring 

K[cr,a is not isomorphic to (K[a])[z], 

Viewing (4.7a) as an operator equation over K[a,a ^] , we have that 

(a -1 l - F (a) + G(a)B(a))x = G(a)r (4.8) 

As in the case of systems over A, it follows from (4.8) that if the triple 
(F (a), G(a) , a 1 ) is zero or coefficient assignable, we can specify the 
behavior of the free response x(t) as t + ®. In fact, if B(a) is chosen so 
that the determinant of o ^ - F(a) + G(a)B(cs) is equal to a , the free 
response x(t) will be zero for t ^ t , where t^ is some positive integer. 
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(c) Consider the closed- loop continuous-time system over R[d] 


given by 


dx(t) 

dt 


= (F(d) - G(d)B(d)x) (t) + (G (d) r) (t) 


(4.9a) 


y(t) = (H(d)x)(t) 


(4.9b) 


As in the previous example, the feedback signal -(B(d)x)(t) consists of 
delayed versions of x(t). 

Letting D denote the derivative operator, we can view R[d] as a 
subring of R[d,D] which consists of delay differential operators of the 


form ££a. . d 1 D' ) acting on some (unspecified) space of functions (see [14] 
ij 13 

for details) . As shown in [12] , D is transcendental over R[d] , so R[d,D] is 
isomorphic to the ring (R[d])[z] of polynomials in z with coefficients in 
R[d] . 

Viewing (4.9a) as an operator equation over R[d,D] , we have that 


(DI - F (d) + G (d) B (d) ) x = G (d) r 


(4.10) 


In this case, if the triple (F (d) ,G (d) ,D) is zero or coefficient assignable, 
by choosing B(d) we can get the free response x(t) to converge to zero at 

ctt 

an exponential rate e for any a > 0 (see [14]). 

4.3 Single-input case. 

Let (F , G,H) be an abstract system over A, where now G is an n-element 
column vector over A (i.e., m = 1, which is the single-input case). Given 0 e (2, 
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where ft is some ring extension of A, in this section we shall present 
necessary and sufficient conditions for the triple (F,G,0) to be zero or 
coefficient assignable. Our first approach to this problem is based on the 
concept of cyclicity which we now define. 

DEFINITION 4.2 A n x n matrix F over a commutative ring A is said 
to be cyclic with generator geA* 1 if every element of A n can be expressed 
as a finite A-linear combination of g,Fg , . . . ,F 1 g , . . . 

It follows from the Cayley-Hamilton theorem that a n*n F is cyclic 
with generator g if and only if the elements g,Fg,...,F n ^g generate A R ; 
i.e., every element of A n can be written as an A-linear combination of 
g,Fg,...,F n ^g. Hence F is cyclic with generator g if and only if the 
determinant of the matrix [g,Fg,...,F n 1 g] has an inverse in A. We also 
have the following result. 

PROPOSITION 4.3 Let (F,G,H) be a single-input system, so that G = geA n . 
Then (F,9 ,H) is reachable (Definition 3.21) if and only if F is cyclic with 
generator g. 

As we now show, cyclicity is equivalent to the existence of a 

particular canonical form. As before, let z be a symbol. Given a n*n matrix 

n-1 

n v i 

F over A, we have that det (zl - F) = z + > a.z for some a.eA. Define 

i=0 1 
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PROPOSITION 4.4 The matrix F is cyclic with generator g if and only 
if there exists a n*n invertible matrix P over A such that F = P '*’FP and 
g = P g, where F and g are given by (4.11) . If F is cyclic with generator 
g, P can be taken to be the matrix [g,Fg,...,F n "'"gl [g,Fg, . . . , (F) n "'"g] \ 

This result is an easy generalization of the field case, so the proof 
is omitted. Using Proposition 4.4, we can prove the following result relating 
cyclicity and assignability. 

THEOREM 4.5 Let (F ,g,H) be a single-input system and let 6 be a fixed 
element of with 0 transcendental over A. Then the following are 

equivalent. 

(1) F is cyclic with generator g. 

(2) The triple (F,g,6) is coefficient assignable. 

(3) The triple (F,g,0) is zero assignable. 

Proof. Clearly, (2)^(3). It follows from Sontag [24, Proposition 4.3] 

that (3) implies that (F,g,H) is reachable. Hence F is cyclic with generator 

g. So the proof is completed if it is shown that (l)-^(2) : Given b^ ,b^ , . . . ,b n _^ 

belonging to A, let B denote the row vector [b -a b -a., . . -b , - a , ], 

0 o 1 1 n-1 n-1 

n-1 

where det(zl - F) = z + £ a.z 1 . Then det(0I - F + gB) = 

n-1 . i=0 1 

0 + 2. b.0 , where F,g are given by (4.11). If F is cyclic with generator g, 

i=o 1 

it follows from Proposition 4.4 that det(0I - F + gB) = det(0I - F + gB) 
where B = B(P ^) . Thus the triple (F,g,0) is coefficient assignable. 

By Theorem 4.5, if 0 is transcendental over A, the triple (F,g,0) is 
zero or coefficient assignable if and only if the determinant of [g,Fg,...,F n ^g] 
has an inverse in A. Unfortunately, this condition can be very severe. For 
example, in the case of a single-input continuous-time system over R[d] , the 
triple (F (d) ,g (d) ,D) is zero or coefficient assignable if and only if the 
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determinant of [g (d) , F (d) g (d) , . . . ,F (d) n '*’g(d)] is a nonzero element 
of R (which is seldom the case) . This limitation can be overcome by 
considering feedback matrices defined over a ring extension C of R[d] . 

If C contains a "large" subset of elements that are invertible in C, 
but not in R[d] , assignability with respect to C (i.e., B is over C) 
will be a much weaker condition than assignability with respect to R[d] . 

An interesting example of a ring extension of R[d] is the ring 
consisting of all proper rational functions in d - ^ which satisfy a stability 
criterion (see Sontag [24] for details) . It may seem reasonable to take 
C to be the quotient field R(d) , since every nonzero element of R(d) is 
invertible. However, in general it is not possible to implement feedback 
matrices defined over R(d) since they may contain noncausal or unstable 
elements. For instance, it is not possible to implement the ideal predictor 
given by d \ 

When 0 is algebraic over A, cyclicity of F is no longer necessary 
for assignability (see Example 4.8) . In this case, a necessary and 
sufficient condition for coefficient assignability can be derived using 
the identity 


det (01 - F + gB) = det(0I - F) + B[Adj(0I - F)]g (4.12) 


where Adj (01 - F) is the transpose of the matrix of cofactors of 01 - F 
(see [14] ) . 

PROPOSITION 4.6 The triple (F,g,0) is coefficient assignable if and 
only if there is a n*n matrix W over A such that 

R 1 


W [Adj (01 - F)]g = 


(4.13) 


0"- 1 
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Proof. Suppose that there is a W satisfying (4.13). Given b_,b, ,...,b 

0 1 n— . 

belonging to A, let B = [b - a„ b, - a. . . * b , - a ,]W, where 

,0011 n-1 n-1 

n-1 

det(0I - F) = 0 n + £ a.9 1 . It follows from (4.12) that det(0I - F + gB) = 

n-1 . i=0 1 

n. r» i 

0 +2. b.0 . Conversely, suppose that (F,g,0) is coefficient assignable. 

i=0 1 

Then for i = 1,2, ... ,n, there is a row vector B^ such that det(0I - F + gB^) 

-j ^ th 

- det(8l - F) = 0 . Let W denote the nxn matrix with i row equal to 

B^. Then W satisfies (4.13). 

COROLLARY 4.7 Suppose that F is cyclic with generator g. Then 
(4.13) is satisfied wtih W = P where P is the matrix defined in Proposition 
4.4. 

Proof. It is easily verified that Adj (01 - F)g = [1 0... 0 n where 

F,g are given by (4.11). Using F = P ‘'‘FP, g = P ^g, we have that W = P ^ 
satisfies (4.13). 

EXAMPLE 4.8 Consider the discrete-time system over R[o] with 



-1 -a 


l 

F (a) = 


, g(a) = 



__o 1 


0 

i— 


This is the system given by the block diagram in Example 1.2. For this 
example, the determinant of [g (a) ,F (a) g (a) ] is equal to a, which does 
not have an inverse in R[a] . Thus F(a) is not cyclic with generator g(o). 
But since o ^ is algebraic over R[cr] , cyclicity is not necessary for assign- 
ability. By Proposition 4.6, (F(a),g(o),o 1 ) is coefficient assignable 
if and only if there is a W over R[a] such that 

W[Adj(a 1 I - F (a) ) ]g(cr) = W 


-1 


- 1 


1 

-1 


(4.14) 
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mi mu 


hi i ■ ii 


Multiplying both sides of (4-14) by a, we have that (F(o),g(o), a 1 ) 
is coefficient assignable if and only if there is a W over R[g] such that 

(4.15) 



Since l-o and o are relatively prime elements of R[cr], there is a W 
satisfying (4.15). Further, a solution W can be computed using the Euclidean 
algorithm. This yields 


W 


a 1 

o + l 1 


Hence (F(o),g(o), a "*■) is coefficient assignable. Let's compute B(o) 

-1 -2 -1 

so that det(cr I - F (a) + g (o) B (a) ) = a . We have that det(o I - F(o)) 

-2 2 
= a - 1 + a . Thus 


B(0) = [b Q -a 0 b^-a^] W 



B(o) = [0+l-o 2 0-(0)] 

0 

1 


_? + l 

1 

3 

B(o) = [-o + a 1 

2 

- cr ] 



In this case, the feedback signal -(B(o)x)(t) is equal to x^(t-3) - x^(t-l) 
+ x 2 (t-2) - x 2 (t) . 

As noted in Example 1.2, this system also has a four-dimensional 
representation over R given by (1.8). Letting F,g denote the coefficients 
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^ A A A ^ A A ^ A 

of (1.8), we have that the rank of [g,Fg, (F) g, (F) g] is equal to four, 
so (F,g,o ■*") is coefficient assignable. It is interesting to note that 
in this framework, the computation of feedback matrices using the canonical 
form (4.11) requires that we work with 4><4 matrices over R; in particular, 

A A A A A A A 

it is necessary to invert [g,Fg, (F) 2 g, (F) 3 g ] . In general this procedure 
appears to be less efficient from a computational standpoint, than the one 
given above which utilizes the Euclidean algorithm to compute a W satisfying 
(4.13). This point is currently under investigation. 

4.4 Multi-input case. 

Given an abstract system (F,G,H) over A, in this section we assume 
that G is nxm with m > 1 (the multi-input case) . As before, let 0 be a 
fixed element of We then have the following two results on zero assign- 

ability. 

THEOREM 4.9 If 0 is transcendental over A, the triple (F,G,0) 
is zero assignable only if (F,G,H) is reachable. 

THEOREM 4.10 If the ring A is a p.i.d., reachability of (F,G,H) 
implies that (F,G,0) is zero assignable. 

Theorem 4.9 (resp. Theorem 4.10) follows from the work of Sontag [24] 
(Morse [32]). It should be noted that there is a constructive proof of 
Theorem 4.10 based on the diagonalization of matrices defined over a 
p.i.d. (see [32]). Instead of pursuing this, we shall consider the problem 
of coefficient assignability using the concept of cyclicity. 

PROPOSITION 4.11 Given (F ,G,H) and 0efi, suppose that there is a 
feedback matrix L over A such that F-GL is cyclic with generator g = Gu 
for some ueA m . Then (F,G,0) is coefficient assignable. Further, there 
exists an n x n invertible matrix P over A such that, for any bQ,b^,...,b n ^ 
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belonging to A, there is a feedback matrix B over A such that 


P _1 (F-GB) P = 


Proof. Suppose that F-GL is cyclic with generator g = Gu. Write 
n-1 

n v i — — 

det(0I - F + GL) =0 + / a.0 , and let F,g denote the matrices given by 

1=0 1 - -l - -i 

(4.11). Then by Proposition 4.4, F =P (F-GL)P,g = P g, where 

P= [g, (F-GL)g,..., (F-GL) n_1 g] [g,Fg, ..., (F) n-1 g] _1 (4.17) 

Now given b Q ,b^, . . . ,b^_^ belonging to A, let 

B = L - uq (P _1 ) (4.18) 

where q = [a Q -b 0 a^ . . . a n . f b n . 1 l . 

n-1 

It follows that det(0I - F + GB) = 0 n + £ b.0 1 , so (F,G,0) is coefficient 

i= ° 1 -1 

assignable. Again applying Proposition 4.4, we have that P (F - GB) P 
is equal to the right side of (4.16). 

The result given in Proposition 4.11 has a very interesting interpreta- 
tion for continuous- time systems over R[d] (or discrete-time systems over K[o] ) 
By choosing the b^ to be elements of R (or K) , we can construct a closed- loop 
system that is equivalent to a system (F ,G,H) with F over R (or K) and with 
any desired eigenvalues of F. Here equivalent means that the state x(t) of 
(F,G,H) is related to the state x(t) of the closed-loop system by the transfor- 


0 

0 

0 

-b. 


1 

0 

0 

-b. 


0 

1 

0 

-b„ 


0 

0 


0 

0 

1 

-b 


n-1 


(4.16) 
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II 

~ -1 

mation x(t) = (P x) (t) , where P is given by (4.17). This construction will 
be illustrated by an example later. 

To utilize the above construction on coefficient assignability, we 
need to consider conditions for the existence of a feedback matrix L such 
that F-GL is cyclic with generator g = Gu for some ueA m . When A is a field, 
it is known [33] that reachability is necessary and sufficient. When A is 
a ring, reachability is necessary, but it is not sufficient. In particular, 

Sontag [24] has shown that when A = Z or R[d] , reachability is not sufficient. 

A necessary and sufficient condition for the existence of a cyclic 
F-GL is given in the following theorem (the proof is omitted) . 

THEOREM 4.12 There exists a matrix L over A such that F-GL is cyclic 
with generator g = Gu for some ueA m if and only if there are elements 
UqjU^, . . . , u n _ 1 eA m such that the elements g^g^, • • - '5 n _i generate A n , where 
g Q = Gu q and g^ = Fg^_^ + Gu ^ for i = l»2,...,n-l. Given Uq,u^, . . . , u n _^ 
and ^q'^ 2.' ' ' ‘ r< ^n 1 sat isfying these condtions, L can be taken to be the 
matrix - [u ,u 2 , . . . ^^,0] [g Q ,g lf . . . ,g n _ 1 ] 1 . 

The result in Theorem 4.12 specializes to a result obtained by 
Hautus [34] in the case when A is a field. When A is a field, there is 
a constructive procedure (due to Heymann [33]) for finding elements u g , u ;l ' * " • ,u n -i 

and g ,g ,g ^ satisfying the conditions in Theorem 4.12. When A is a ring, 

a procedure that yields a solution whenever one exists has not been found as 
yet. However, there is a procedure involving trial-and-error , which seems 
to work well in examples for which the dimension is not large. The steps 
are as follows. 
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Step 1. If the condition in Theorem 4.12 holds, there exists a 

basis {a ,a . ...,a } for A n with a, = Gu for some u eA m . When A is a 
1 2 n 10 0 

p.i.d., a basis {a, ,a„, — ,a } with a, = Gu„ can be computed (assuming one 
1 2 n 1 0 

exists) by putting G into Smith form. Let g^ = Gu^. 

Step 2. Define the map 


P :A 


■+ A 


n-1 


i=l 


a. a. 
i r 



Now compute a basis {g ,g 2 »--.,8 } for A° 1 with g = P^ (Fg Q + Gu^) 

for some u^ £ A m . Again, when A is a p.i.d., there is an algorithm for com- 
puting such a basis (assuming that one exists). If no such basis exists, 
it is necessary to repeat Step 1 in order to compute another g^ = Gu Q . Given 
such a basis with (Fg Q + Gu^) , let = Fg Q + Gu^. 

Step 3. Define the map __ 


P 2 :A ' 


n-1 


n-2 r 1 b.g. 
A : l li 

i=l 



Compute a basis {y, ,Y_,...,Y „} for A n 2 such that v = P P (Fg + Gu ) 

1 2 n-2 1 1 21 1 2 

for some u^eA 111 , and so on. If possible, continue until g^ ^ is constructed. 
4.5 Example. 

Consider the continuous-time system over R[d] given by 


dx (t) 

— rr — = -x (t) + x (t) + u (t) + u (t) 

dt 12 12 

dx (t) 

— — — = x (t) + x (t-a) + u (t) + u (t-a) 
dt 1 2 12 
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In the operator notation, we have that 



-1 1 


1 1 

F(d) = 


and G(d) = 



_ 1. d 


_1 d 


Using the procedure given in Section 4.4, we shall first attempt to construct 

a matrix L(d) such that F (d) -G (d)L (d) is cyclic with generator g = G(d)u for 
2 

some ueR[d] . 

We have that {a^c^}, where = [1 1]' and = [0 1]', 

2 

is a basis of R[d] , with = G(d) [1 0] ' = g Q . Then P^tFCdig^ = 1 + d. 

2 

Now we must find u^eRfd] , such that 1 + d + P^(G(d)u^) has an inverse in 
R[d] . This is satisfied with u^ = [0 -1]'. Then g^ = F(d)gQ + G(d)u^ = [-1 

Thus, 


L(d) = -[ U;L 0] [g^g-^ 


0 0 

.5 .5 


This gives 


-1.5 .5 


F (d) -G (d) L (d) = 


l-.5d . 5d 


By construction, F (d) -G (d) L (d) is cyclic with generator g 

Now let's compute a feedback matrix B(d) so that det(DI - 
2 

D + 3D + 2. We have that 


1 

F (d) + G (d) B (d) ) = 


[g, (F-GL)g] 


1 -1 

1 1 


1 ] ' . 
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I 


II II 


II I I II II I I II I 


Using (4.17), we get 


P = (.5) 


1-d 2 

d-5 2 


and from (4.18), 


B = L - uq(P ■*■) 


where u = 


and g = (.5) [-d-5 -3-d] 


This yields 


O 'y 

r_ 5-d d +4d+7 


B = (1/8) 


Finally, we get 


F (d) - G (d) B (d) = (1/8) 


d 2 -17 


-d -4d-3 


d 2 -4d+3 -d 2 -7 


By construction, the closed-loop system is equivalent to a system whose 
free (unforced) behavior is given by 


dx^ (t) 
dt" ' 


x 2 (t) 


dx 2 (t) 

dt 


= -2x 1 (t) - 3x 2 (t) 
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4.6 Observers. 


Let (F,G,H) be an m- input p-output n-dimensional abstract system 
over A, and let L be a n*p matrix over A. The system (F-LH,G,I) , 
where G = [G L] and X is the nxn identity matrix, is an observer for 
the system (F,G,H) . The system (F-LH,G,I) can be interpreted in the usual 
manner. For example, if F,G,H are the coefficient matrices of a discrete- 
time system over a ring of scalars A, the observer is given by the dynamical 
equations 

x(t + 1) = (F-LH)x(t) + G 

y(t) = x(t) 

where u (resp. y) is the input (output) of the given system. 

Now let (F-LH,0, I) denote the free (unforced) system associated with 
(F-LH, G , I) , and consider the dual of (F-LH,0,I) given by (F ' - (H‘ ) (L 1 ) ,1 ,0) . 

The key point to note here is that the dual (F'-(H') (L'),I,0) can be viewed 
as a closed-loop system constructed from the dual (F',H',0) of (F,0,H) 
with the feedback matrix equal to L'. Therefore, the design of observers 
can be approached by considering the design of state-feedback controllers 
for the dual system (F 1 ,H' ,0) using the above results. Note that in order 
to apply the above results in those cases for which 0 is transcendental 
over A, it is necessary that the dual be reachable. As noted in the preceding 
chapter, reachability of the dual is a stronger condition than observability 
of the given system. 
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5. LINEAR TIME-VARYING SYSTEMS 


5 . 1 Introduction . 

In this chapter we present an algebraic theory for the class of linear 
time-varying discrete-time systems. The theory, which is taken from [35] , 
is based on the concept of a semilinear transformation that is derived 
from the given system. Using the notion of a cyclic semilinear transformation, 
we develop a (control) canonical form which is then applied to the construc- 
tion of state-feedback controllers. 

5.2 System description and properties. 

Let K be a fixed field and let A denote the set of all functions defined 
on Z with values in K. With addition and multiplication given by 


(a + b) (t) = a(t) + b (t) 

a,beA 

(ab) (t) = a(t)b(t) 

A is a commutative ring. It should be noted that A contains divisors of 
zero, so it is not an integral domain. 

Let a denote the right-shift operator on A, defined by (era) (t) = 
a(t-l). For any a,beA, a(a + b) = era + ab and cr(ab) = (era) (erb) . The 
operator a has an inverse a 1 = left-shift operator on A. 

We then have the following notion of a system over A. 

DEFINITION 5.1 Let m,n,p be fixed positive integers. A m-input 
p-output n-dimensional linear time-varying discrete-time system over the 
ring of time functions A is a triple (F,G,H) of nxn, nxm, pxn matrices over 
A, together with the dynamical equations 


x ( t + 1) = F (t) x (t) + G (t) u (t) 
y (t) = H (t) x (t) 
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(5.1) 


III I 



where x(t)eK n is the state at time teZ, u(t)eK m is the input at time t, 
and y(t)eK^ is the output at time t. 

We shall work with a modified version of the standard equations 
(5.1): Apply the right-shift operator to both sides of the first equation 

given by (5.1) and let D(t) = F(t-l), E(t) = G(t-l). This gives the 
following equations 


x(t) = D (t) x (t-1) + E (t) u (t-1) 
y (t) = H (t) x (t) 


(5.2) 


From here on, we shall work with the representation (5.2). The system 
given by (5.2) will be denoted by the triple (D,E,H) . 

Our first objective is to characterize systems over A in terms of 
a semilinear transformation. First, we need the following constructions. 

Let A n denote the set of all n-element column vectors over A. With 
componentwise addition 

(v n v_ . . . v ) ' + (w, w_ . . . w ) ' = (v n + w, . . . v + w ) ' 

12 n 12 n 11 nn 

and scalar multiplication 

a(v, v„ . . . v ) ' = (av, av. . . . av ) ' , aeA 
12 n 12 n 

A n is a module over the ring A. A module is similar to a vector space, 
except that the scalars come from a ring rather than a field. 

Given a nxn matrix M over A, define the operator S:A n ->A n :v^-S (v) = m(cfv). 
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where (cfv) (t) = v(t-l) . For any v,weA n , we have that S (v + w) = S (v) + S (w) , 
so S is additive. But for aeA,veA n , 

S(av) = Mcr(av) = M(cra) (ctv) = (cra)S(v) 

which shows that S is not linear with respect to the A-module structure on 

A n (linearity requires that S(av) = aS(v)). The operator S is called a 

semilinear transformation relative to a . 

Now given an n-dimensional system (D,E,H) over A, the operator 

S:A n -tA n :v-*D(av) will be referred to as the semilinear transformation (s . 1. 1. ) 

of the system (D,E,H). We shall first show that the state and output 

responses can be expressed in terms of the system's s.l.t. 

Given t eZ and x eK n , let x denote the element of A n defined by x (t) 
o o o o 

= x q when t = t and x Q (t) = 0 when t / U Let S° = I = identity operator 

on A n , and for i = 1,2,..., define (S^TE) (t) = D(t) (S 1 ^E) (t-1) . Then the 

solution x(t) of (5.2) resulting from initial state x eK. n at initial time t 

o o 

and input u(t), t ^ t , is given by 

~ t-1 _ 

x(t) = (S °x ) (t) + I (S 1 E) (t)u(i) , t > t (5.3) 

o . o 


If the initial state x^ is zero, the output response is 


t-1 


(t) = l (HS fc 1_1 E) (t)u(i) , t > t 


i=t 


(5.4) 
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Note that the expressions (5.3-4) for the state and output responses 
closely resemble the expressions (2.1-2) for the state and output responses 
in the time- invariant case. The primary difference between the two frameworks 
is that in the time- invariant case, dynamical behavior is specified in terms 
of the linear transformation defined by the matrix F; whereas, in the time- 
varying case dynamical behavior is given in terms of the semilinear transfor- 
mation S. 

From (5.4), we see that the input/output behavior of the time-varying 
system (D,E,H) is completely characterized by the sequence (J (t) , ^ (t) , . . . ) 
where J^(t) = HS i "*"E (t) . This observation leads to a realization theory 
based on a Hankel-matrix approach. The details are under development. 

The next concept is reachability. 

DEFINITION 5.2 The system (D,E,H) is reachable at time teZ if, 
for any x(t)eK n , there is an integer N > 0 and inputs u(t-N) ,u(t-N+l) , . . . ,u(t-l) 
that drive the system from the zero state at time t-N to the state x(t) at time 
t. If there is a fixed N for all x(t)eK n , (D,E , H) is reachable in N steps 
at time t. The system (D , E , H) is reachable in N steps at all times if it 
is completely reachable in N steps at each teZ. 

A necessary and sufficient condition for reachability at time t can 
be expressed in terms of the system's s.l.t. S as follows. 

PROPOSITION 5.3 The following conditions are equivalent. 

(1) (D,E,H) is reachable at time t. 

(2) There is an integer N > 0 such that (D,E,H) is reachable 
in N steps at time t. 

N— 1 

(3) There is an integer N > 0 such that the rank of |E,SE,...,S E] (t) 
is equal to n. 
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Proof. Follows from (5.3) and the fact that K n is a finite-dimensional 
vector space. 

5.3 Cyclicity. 

In this section we shall develop a structure theory based on a notion 
of cyclicity. This concept is defined in terms of the system's s.l.t. 
as follows. 

An s.l.t. S:A n -*-A n is cyclic with generator geA n if every v£A n can 

be written as a finite A- linear combination of g,Sg,...» that is, A n 

can be generated from g,Sg,. Since A* 1 consists of n-element column 

vectors over A, it follows that S is cyclic with generator g if and only 

q— 1 

if there is a positive integer q such that the elements g,Sg,...,S g 
generate A n . In constrast to the theory of linear transformations, it can 
happen that the smallest possible value of q is strictly greater than n. 

There is no Cayley-Hamilton theorem for s.l.t. 's, which would guarantee that 


q = n. 

EXAMPLE 5 . 4 Suppose that D 

Then 


1 t 


0 


. Let g = 


1 1 


_ 1 _ 



t 


2t-l 

S (g) = 

1 

and S (g) = 

t 


The rank of 


[g , Sg , S g] 


0 t 2t-l 
lit 


2 

is equal to 2 for all teZ, which shows that the elements g,Sg,S g generate 
2 

A . Since the determinant of [g,Sg] is equal to -t, the elements g,Sg do not 
2 

generate A . 
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An s.l.t. S:A n ->-A n is n-cyclic with generator geA n if every veA* 1 
can be written as an A-linear combination of g,Sg,...,s n '*"g. This is the 

case if and only if the nxn matrix [g,Sg,...,S n 1 g] is invertible over A. 

The next result interconnects cyclicity and reachability. 

PROPOSITION 5.5 Let (D,e,H) be a single-input (m = 1) n-dimensional 

system. Then (D,e,H) is reachable in N steps at all times for some N > 0 

(resp. reachable in n steps at all times) if and only if S is cyclic 

(resp. n-cyclic) with generator e. 

Proof. Follows from Proposition 5.3 and the definition of cyclicity. 
As proved in [35] , n-cyclicity is equivalent to the existence of a 
canonical form that is identical to the form considered in Section 4.3. 

The constructions are as follows. 

Suppose that S :A n ->-A n : v-H3 (ov) is n-cyclic with generator g. Let u 
denote the matrix [g,Sg,...,S n ^g] and define 


a = a ... a ]' = -u 1 (S n g) 

0 1 n-1 


Let S denote the s.l.t. on A defined by S (v) = D(av) where 


D = 


0 

0 


1 

0 


0 0 


0 

1 


-1 -2 

-a a. -a a„ 


0 

0 


-n+1 
-a a 


(5.5) 


n-1 
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Finally, let U = [g,Sg, . . . ,S n ^g] where g = [0 0 ... 0 l]'eA n . 

- -1 - -1 -1 
PROPOSITION 5.6 D = P D(aP) and g = P g where P = U(U) and 

(OP) (t) = P(t-l) . 

If we apply the coordinate transformation x(t) = P ^"(tJxCt) to the 
dynamical equations (5.2), we have that 

x (t) = 5(t)x(t-l) + P _1 (t)E (t) u (t-1) 

y (t) = H (t) P (t) x (t) 

where D is given by (5.5). Thus n-cyclicity implies that the given system 
(D,E,H) is equivalent to a system (D,E,H) with D in the form (5.5). This 
result is useful in the study of state feedback, which we now consider. 

Given the system (D,E,H) with the dynamical equations (5.2), let 
u(t-l) = -B (t) x (t-1) + r(t-l), where B (t) is a mxn feedback matrix over 
A and r(t) is an external signal. The resulting closed- loop system is 
given by the triple (D-EB ,E ,H) , with the dynamical equations 

x(t) = [D (t) - E(t)B(t) ] x (t-1) + E (t)r (t-1) 
y(t) = H (t) x (t) 

The s.l.t. of the closed-loop system will be denoted by S . Then S (v) = 
(D-EB) (crv) for all veA n . 

Now suppose that there is a feedback matrix T over A such that 
is n-cyclic with generator g = Eu for some ueA m . Then [35] for any ele- 
ments b o ,b l' ‘ * ,b n-l belonging to there is a feedback matrix B over A 
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such that 


(5.6) 

where P is a n x n invertible matrix over A. 

This result implies that if we select the to be time independent, 
we can construct a closed- loop system that is equivalent to a system 
(D,E,H) with D time independent and with any desired eigenvalues for D. 

In particular, if we set b^ = 0 for all i, the free response of the closed- 
loop system will become zero after n steps (i.e., we have a dead-beat 
control system) . 

Let us first consider conditions for the existence of a feedback 
matrix T such that is n-cyclic with generator g = Eu. Then we will give 
the constructions for setting up (5.6). 

As shown in [35] , the existence of an n-cyclic S T requires that the 
given system be reachable in n steps at all times. However, reachability 
is not a sufficient condition. A necessary and sufficient condition is 
given in the following theorem, which is the time-varying version of 
Theorem 4.12. 

THEOREM 5 . 7 There is a T over A such that S T is n-cyclic with 

generator g = Eu for some ueA m if and only if there are elements u ,u , ... , 

O' 1 ' 

U __ 1 €A such that g ,g ,...,g generate A n , where g. - Eu. and g, = 
n x U x n— X 0 0 1 

D ^ ag i-l* + Eu i for 1 = l,2,...,n-l. Given u^u^ . . . ,u n _ x and g 0 ,g lf . . . ,g n 


P _1 (D— EB) (CTP) = 


0 

0 

0 

-b 


1 

0 

0 


o 


0 

1 


-cf 2 b_ 


0 0 

0 . . . 0 

1 

-n+1. 


n-1 
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I 


satisfying these conditions, T can be taken to be the matrix - [u n ,u„ , . . . ,u , ,0] 

1 2 n— i 

[ (ag ) , (ag .),..., (ag )] L . 

0 1 n-i 

The computation of the u_^ and g^ can be carried out using a procedure 
corresponding to the one given in Section 4.4 for the time-invariant case. 

An example will be given shortly. 

Now suppose that we have a T and a g = Eu, such that S T is n-cyclic 
with generator g. Given b Q ,b^, . . . ,b n _^eA, we shall construct a feedback 
matrix B satisfying (5.6). First, let U = [g,S T ,...,S^ 1 g] and define 

d = [d 0 d 1 ... d n _ L r = -U _1 (s^g) 

Let S T denote the s.l.t. on A n defined by S^(v) = D t (ov) where D T 
by (5.5) with the a^ equal to the d^. Then by Proposition 5.6, 
where 

p = U(U 1 ) = [g,S T g,...,sJ _1 g] [5 ,S t 5,..., (S^,) 11 " 1 ^" 1 (5.7) 


is given 

= P _1 (D-ET) (CTP) 


Let 


B 


T -uq(aP) 


(5.8) 


where 


q = [d -b 
0 0 


d -b 
1 1 


d -b 
n-1 n-1 


] 


Then [35]P 


(D-EB) (aP) is equal to the matrix given in (5.6) . 
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5 . 4 Example . 

Consider the time-varying system given by 

x^(t) = t x^Ct-Ut (t+l)x 2 (t-l) + t u^Ct-l) 
x 2 (t) = x 2 (t-l) + u 2 (t-l) 

For this example. 



t 

t+1 


t 

0 

D = 

0 

1_ 

and E = 

_0 

1 


We shall first construct a matrix T such that S T is 2-cyclic with generator 

g = Eu. Following the time-varying version of the procedure given in 

2 

Section 4.4, choose u^,u^cA so that [EUq,S(Eu^) + Eu^] is invertible over A. 
A solution is 


u 


0 



and u^ = 


-1 

0 


Then 


T = - 





1 


0] 


This gives 


D - ET = 


2t 

0 


1 


1 
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By construction, s is 2-cyclic with generator g = Eu„ 
T 0 

Now let 1 s compute a feedback matrix B such that 

P _1 (D - EB) (cP) = 


0 1 
0 0 


We have that 


U = [g,s T g] 


0 

1 


1 

1 


and 


d = -U _1 (S^g) 


2t 

-2t-l 


Then 


D 


T 


0 1 
-2t 2t+3 


which gives 



From (5.7) , 


P =U(U X ) 


1 

-2t-2 


0 

1 


- 64 - 



and from (5.8) 


B = t - uq(aP) where u = 
This yields 


, and q = [2t -2t-3] . 


B 


-1 

4t (t+1) 


1 

2t+3 


Then 


D - EB = 


2t 

-4t (t+1) 


1 

-2 (t+1) 


By construction, the free system x(t) = (D-EB)x(t-l) has the property 

2 

that x(t) = 0 for t ^ t +2 for any initial state x(t )eK at time t = t . 

o o o 
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